168 research outputs found

    Diversity of Lactase Persistence Alleles in Ethiopia:Signature of a Soft Selective Sweep

    Get PDF
    The persistent expression of lactase into adulthood in humans is a recent genetic adaptation that allows the consumption of milk from other mammals after weaning. In Europe, a single allele (-13910(∗)T, rs4988235) in an upstream region that acts as an enhancer to the expression of the lactase gene LCT is responsible for lactase persistence and appears to have been under strong directional selection in the last 5,000 years, evidenced by the widespread occurrence of this allele on an extended haplotype. In Africa and the Middle East, the situation is more complicated and at least three other alleles (-13907(∗)G, rs41525747; -13915(∗)G, rs41380347; -14010(∗)C, rs145946881) in the same LCT enhancer region can cause continued lactase expression. Here we examine the LCT enhancer sequence in a large lactose-tolerance-tested Ethiopian cohort of more than 350 individuals. We show that a further SNP, -14009T>G (ss 820486563), is significantly associated with lactose-digester status, and in vitro functional tests confirm that the -14009(∗)G allele also increases expression of an LCT promoter construct. The derived alleles in the LCT enhancer region are spread through several ethnic groups, and we report a greater genetic diversity in lactose digesters than in nondigesters. By examining flanking markers to control for the effects of mutation and demography, we further describe, from empirical evidence, the signature of a soft selective sweep

    World-wide distributions of lactase persistence alleles and the complex effects of recombination and selection

    Get PDF
    The genetic trait of lactase persistence (LP) is associated with at least five independent functional single nucleotide variants in a regulatory region about 14 kb upstream of the lactase gene [-13910*T (rs4988235), -13907*G (rs41525747), -13915*G (rs41380347), -14009*G (rs869051967) and -14010*C (rs145946881)]. These alleles have been inferred to have spread recently and present-day frequencies have been attributed to positive selection for the ability of adult humans to digest lactose without risk of symptoms of lactose intolerance. One of the inferential approaches used to estimate the level of past selection has been to determine the extent of haplotype homozygosity (EHH) of the sequence surrounding the SNP of interest. We report here new data on the frequencies of the known LP alleles in the 'Old World' and their haplotype lineages. We examine and confirm EHH of each of the LP alleles in relation to their distinct lineages, but also show marked EHH for one of the older haplotypes that does not carry any of the five LP alleles. The region of EHH of this (B) haplotype exactly coincides with a region of suppressed recombination that is detectable in families as well as in population data, and the results show how such suppression may have exaggerated haplotype-based measures of past selection

    Identifying high-impact variants and genes in exomes of Ashkenazi Jewish inflammatory bowel disease patients

    Get PDF
    Inflammatory bowel disease (IBD) is a group of chronic digestive tract inflammatory conditions whose genetic etiology is still poorly understood. The incidence of IBD is particularly high among Ashkenazi Jews. Here, we identify 8 novel and plausible IBD-causing genes from the exomes of 4453 genetically identified Ashkenazi Jewish IBD cases (1734) and controls (2719). Various biological pathway analyses are performed, along with bulk and single-cell RNA sequencing, to demonstrate the likely physiological relatedness of the novel genes to IBD. Importantly, we demonstrate that the rare and high impact genetic architecture of Ashkenazi Jewish adult IBD displays significant overlap with very early onset-IBD genetics. Moreover, by performing biobank phenome-wide analyses, we find that IBD genes have pleiotropic effects that involve other immune responses. Finally, we show that polygenic risk score analyses based on genome-wide high impact variants have high power to predict IBD susceptibility

    Metabolites of milk intake: a metabolomic approach in UK twins with findings replicated in two European cohorts

    Get PDF
    Purpose: Milk provides a significant source of calcium, protein, vitamins and other minerals to Western populations throughout life. Due to its widespread use, the metabolic and health impact of milk consumption warrants further investigation and biomarkers would aid epidemiological studies.  Methods: Milk intake assessed by a validated food frequency questionnaire was analyzed against fasting blood metabolomic profiles from two metabolomic platforms in females from the TwinsUK cohort (n = 3559). The top metabolites were then replicated in two independent populations (EGCUT, n = 1109 and KORA, n = 1593), and the results from all cohorts were meta-analyzed.  Results: Four metabolites were significantly associated with milk intake in the TwinsUK cohort after adjustment for multiple testing (P < 8.08 × 10−5) and covariates (BMI, age, batch effects, family relatedness and dietary covariates) and replicated in the independent cohorts. Among the metabolites identified, the carnitine metabolite trimethyl-N-aminovalerate (β = 0.012, SE = 0.002, P = 2.98 × 10−12) and the nucleotide uridine (β = 0.004, SE = 0.001, P = 9.86 × 10−6) were the strongest novel predictive biomarkers from the non-targeted platform. Notably, the association between trimethyl-N-aminovalerate and milk intake was significant in a group of MZ twins discordant for milk intake (β = 0.050, SE = 0.015, P = 7.53 × 10−4) and validated in the urine of 236 UK twins (β = 0.091, SE = 0.032, P = 0.004). Two metabolites from the targeted platform, hydroxysphingomyelin C14:1 (β = 0.034, SE = 0.005, P = 9.75 × 10−14) and diacylphosphatidylcholine C28:1 (β = 0.034, SE = 0.004, P = 4.53 × 10−16), were also replicated.  Conclusions: We identified and replicated in independent populations four novel biomarkers of milk intake: trimethyl-N-aminovalerate, uridine, hydroxysphingomyelin C14:1 and diacylphosphatidylcholine C28:1. Together, these metabolites have potential to objectively examine and refine milk-disease associations

    Whole Exome Sequencing of HIV-1 long-term non-progressors identifies rare variants in genes encoding innate immune sensors and signaling molecules

    Get PDF
    Abstract Common CCR5-∆32 and HLA alleles only explain a minority of the HIV long-term non-progressor (LTNP) and elite controller (EC) phenotypes. To identify rare genetic variants contributing to the slow disease progression phenotypes, we performed whole exome sequencing (WES) on seven LTNPs and four ECs. HLA and CCR5 allele status, total HIV DNA reservoir size, as well as variant-related functional differences between the ECs, LTNPs, and eleven age- and gender-matched HIV-infected non-controllers on antiretroviral therapy (NCARTs) were investigated. Several rare variants were identified in genes involved in innate immune sensing, CD4-dependent infectivity, HIV trafficking, and HIV transcription mainly within the LTNP group. ECs and LTNPs had a significantly lower HIV reservoir compared to NCARTs. Furthermore, three LTNPs with variants affecting HIV nuclear import showed integrated HIV DNA levels below detection limit after in vitro infection. HIV slow progressors with variants in the TLR and NOD2 pathways showed reduced pro-inflammatory responses compared to matched controls. Low-range plasma levels of fibronectin was observed in a LTNP harboring two FN1 variants. Taken together, this study identified rare variants in LTNPs as well as in one EC, which may contribute to understanding of HIV pathogenesis and these slow progressor phenotypes, especially in individuals without protecting CCR5-∆32 and HLA alleles

    The Origins of Lactase Persistence in Europe

    Get PDF
    Lactase persistence (LP) is common among people of European ancestry, but with the exception of some African, Middle Eastern and southern Asian groups, is rare or absent elsewhere in the world. Lactase gene haplotype conservation around a polymorphism strongly associated with LP in Europeans (−13,910 C/T) indicates that the derived allele is recent in origin and has been subject to strong positive selection. Furthermore, ancient DNA work has shown that the −13,910*T (derived) allele was very rare or absent in early Neolithic central Europeans. It is unlikely that LP would provide a selective advantage without a supply of fresh milk, and this has lead to a gene-culture coevolutionary model where lactase persistence is only favoured in cultures practicing dairying, and dairying is more favoured in lactase persistent populations. We have developed a flexible demic computer simulation model to explore the spread of lactase persistence, dairying, other subsistence practices and unlinked genetic markers in Europe and western Asia's geographic space. Using data on −13,910*T allele frequency and farming arrival dates across Europe, and approximate Bayesian computation to estimate parameters of interest, we infer that the −13,910*T allele first underwent selection among dairying farmers around 7,500 years ago in a region between the central Balkans and central Europe, possibly in association with the dissemination of the Neolithic Linearbandkeramik culture over Central Europe. Furthermore, our results suggest that natural selection favouring a lactase persistence allele was not higher in northern latitudes through an increased requirement for dietary vitamin D. Our results provide a coherent and spatially explicit picture of the coevolution of lactase persistence and dairying in Europe

    SARS-CoV-2-related MIS-C: a key to the viral and genetic causes of Kawasaki disease?

    Get PDF

    Whole-genome sequencing reveals host factors underlying critical COVID-19

    Get PDF
    Critical COVID-19 is caused by immune-mediated inflammatory lung injury. Host genetic variation influences the development of illness requiring critical care1 or hospitalization2–4 after infection with SARS-CoV-2. The GenOMICC (Genetics of Mortality in Critical Care) study enables the comparison of genomes from individuals who are critically ill with those of population controls to find underlying disease mechanisms. Here we use whole-genome sequencing in 7,491 critically ill individuals compared with 48,400 controls to discover and replicate 23 independent variants that significantly predispose to critical COVID-19. We identify 16 new independent associations, including variants within genes that are involved in interferon signalling (IL10RB and PLSCR1), leucocyte differentiation (BCL11A) and blood-type antigen secretor status (FUT2). Using transcriptome-wide association and colocalization to infer the effect of gene expression on disease severity, we find evidence that implicates multiple genes—including reduced expression of a membrane flippase (ATP11A), and increased expression of a mucin (MUC1)—in critical disease. Mendelian randomization provides evidence in support of causal roles for myeloid cell adhesion molecules (SELE, ICAM5 and CD209) and the coagulation factor F8, all of which are potentially druggable targets. Our results are broadly consistent with a multi-component model of COVID-19 pathophysiology, in which at least two distinct mechanisms can predispose to life-threatening disease: failure to control viral replication; or an enhanced tendency towards pulmonary inflammation and intravascular coagulation. We show that comparison between cases of critical illness and population controls is highly efficient for the detection of therapeutically relevant mechanisms of disease

    SARS-CoV-2 susceptibility and COVID-19 disease severity are associated with genetic variants affecting gene expression in a variety of tissues

    Get PDF
    Variability in SARS-CoV-2 susceptibility and COVID-19 disease severity between individuals is partly due to genetic factors. Here, we identify 4 genomic loci with suggestive associations for SARS-CoV-2 susceptibility and 19 for COVID-19 disease severity. Four of these 23 loci likely have an ethnicity-specific component. Genome-wide association study (GWAS) signals in 11 loci colocalize with expression quantitative trait loci (eQTLs) associated with the expression of 20 genes in 62 tissues/cell types (range: 1:43 tissues/gene), including lung, brain, heart, muscle, and skin as well as the digestive system and immune system. We perform genetic fine mapping to compute 99% credible SNP sets, which identify 10 GWAS loci that have eight or fewer SNPs in the credible set, including three loci with one single likely causal SNP. Our study suggests that the diverse symptoms and disease severity of COVID-19 observed between individuals is associated with variants across the genome, affecting gene expression levels in a wide variety of tissue types
    corecore